Development of a southern Swedish clustergen voice for speech synthesis
نویسنده
چکیده
This paper describes the development of a speech synthesis voice with a southern Swedish accent. The voice is built for the Festival speech synthesis system using the tools in the festvox suite. The voice type is clustergen, which is a statisticalparametrical synthesis method where parametrical models for phonemes, duration and pitch all are built from a labeled speech database.
منابع مشابه
CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling
Unit selection synthesis has shown itself to be capable of producing high quality natural sounding synthetic speech when constructed from large databases of well-recorded, well-labeled speech. However, the cost in time and expertise of building such voices is still too expensive and specialized to be able to build individual voices for everyone. The quality in unit selection synthesis is direct...
متن کاملAdapting the Filibuster text-to-speech system for Norwegian bokmål
The Filibuster text-to-speech system is specifically designed and developed for the production of digital talking textbooks at university level for students with print impairments. Currently, the system has one Swedish voice, 'Folke', which has been used in production at the Swedish Library of Talking Books and Braille (TPB) since 2007. In August 2008 the development of a Norwegian voice (bokmå...
متن کاملOptimizations and fitting procedures for the liljencrants-fant model for statistical parametric speech synthesis
Every parametric speech synthesizer requires a good excitation model to produce speech that sounds natural. In this paper, we describe efforts toward building one such model using the Liljencrants-Fant (LF) model. We used the Iterative Adaptive Inverse Filtering technique to derive an initial estimate of the glottal flow derivative (GFD). Candidate pitch periods in the estimated GFD were then l...
متن کاملStudy on Unit-Selection and Statistical Parametric Speech Synthesis Techniques
One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...
متن کاملVoice source properties of the speech code
This is an outline of the knowledge we need in order to include the voice source in an advanced model of speech production with applications to text-to-speech rules. Recent results from studies of the Swedish language provide information of source properties and source-vocal tract interaction as a function of the segmental and prosodic frame within an utterance and with reference to aerodynamic...
متن کامل